Measuring Coverage of a Valency Lexicon using Full Syntactic Analysis
نویسندگان
چکیده
Recent development showed that valency information provides a great benefit in many areas of natural language processing. Building valency lexicons is however a complex and time-consuming task from both theoretical and practical points of view, since designing of the lexicon plays a crucial role in its future usability as well as its careful and considerated preparation. As for any manually created resource, it is complicated to evaluate its quality. In this paper we consider the usage of the syntactic parser synt for estimating the coverage of the Verbalex verb valency lexicon for Czech. For this task we extended the phrase extraction functionality of the parser, which we describe briefly. Finally we discuss our results and further development.
منابع مشابه
A Syntactic Valency Lexicon for Persian Verbs: The First Steps towards Persian Dependency Treebank
Valency lexicons are valuable resources for natural language processing. The need for new resources for languages encourages researchers to collect new datasets. One of the most important datasets is valency lexicons. In valency lexicons, information about obligatory and optional complements of words is annotated at the syntactic and semantic levels. In this paper, we report the development of ...
متن کاملBuilding a Large Lexicon of Complex Valency Frames
This paper describes the process of building and using a new comprehensive lexicon of Czech verb valency frames based on complex valency frames. The main features of the lexicon entries are designed to bring important semantic information to computer processing of predicate constructions in running texts. The most notable features include two-level semantic labels with linkage to the Princeton ...
متن کاملPlatform for Full-Syntax Grammar Development Using Meta-grammar Constructs
This paper describes a combination of tools necessary for full or deep syntactic parsing of natural language – the syntactic parser synt, the graphical Grammar Development Workbench, GDW and the VerbaLex verb valency lexicon tools. We describe the development of the mentioned tools and how they integrate into one system that allows a team of experts (computational linguists as well as programme...
متن کاملExploitation of the VerbaLex Verb Valency Lexicon in the Syntactic Analysis of Czech
This paper presents an exploitation of the lexicon of verb valencies for the Czech language named VerbaLex. The VerbaLex lexicon format, called complex valency frames, comprehends all the information found in three independent electronic dictionaries of verb valency frames and it is intensively linked to the Czech WordNet semantic network. The NLP laboratory at FI MU Brno develops a deep syntac...
متن کاملTowards Automatic Extraction of Argument Structure from Corpora
The valency of predicates is a key component of a lexical entry because most, if not all, recent syntactic theories`project' syntactic structure from such information in the lexicon (e.g. Pollard & Sag, 1987). Therefore, a wide-coverage robust parser utilising a grammar based on one of these theories must have access to an accurate dictionary encoding (at a minimum) valency information and prob...
متن کامل